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Abstract. We prove L p estimates (Theorem 1.2 ) for the "biest", a trilinear multiplier 
operator with singular symbol. The methods used are based on the treatment of the 
Walsh analogue of the biest in the prequel (h| of this paper, but with additional techni- 
calities due to the fact that in the Fourier model one cannot obtain perfect localization 
in both space and frequency. 



1. INTRODUCTION 

The bilinear Hilbert transform can be written (modulo minor modifications) as 
B(fi,h)(x):= [ MdU&y™^^ dZid£ 2 , 
where fi, f 2 are test functions on R and the Fourier transform is defined by 

KO ■= [ < 2;: "V(.r) dx. 

Jn 



From the work of Lacey and Thiele [13], [14] we have the following LP estimates on B: 



Theorem 1.1. B maps LP x LP — > U whenever 1 < p,q < oo, l/p+ 1/q = 1/r, and 
2/3<r<oo. 

In this paper we shall study a trilinear variant T of the bilinear Hilbert transform, 
defined by 

T(h, / 2) f 3 )(x) := f /i(6)/ 2 (6)/3(6)e 2 ^ {fl+?2+6) deid£,de s . (1) 

•/£l<6<?3 

The operator T arises naturally from WKB expansions of eigenfunctions of one-dimensional 
Schrodinger operators, following the work of Christ and Kiselev ; see Appendix I of [16 



for further discussion. For these applications it is of interest to obtain L p estimates on T, 
especially in the case when the functions fj are in L 2 . 
From the identity 



we see that T has the same homogeneity as the pointwise product operator, and hence 
we expect estimates of Holder type, i.e. that T maps L Pl x L P2 x L P3 to L Pi when 
l/p' 4 = 1/ Pl + l/pa + l/p 3 . 

If the restriction £i < ^ 2 < ^3 were replaced by some smoother cutoff then it would be 
possible to obtain these L p estimates from the paraproduct theory of Coifman and Meyer 
(see e.g. [|J). However, as for the bilinear Hilbert transform, the rough nature of the 
cutoff makes the treatment of this operator much more delicate. 

l 
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The operator T is connected to B in several ways. For instance, we have T(l, f 2 , f'3) = 
B(Pf 2 ,Pf 3 ), where P is the Riesz projection to the half-line [0, 00). Note that if we 
replace the restriction £1 < £ 2 < £3 by just £1 < £ 2 or £ 2 < £3, the operator T factors 
into B(fi, f 2 )fs or fiB(f 2 , / 3 ). Thus T can be thought of as a hybrid of the above two 
operators, although it does not seem to be as easily factorized as the preceding examples. 

The main purpose of this paper is to obtain a large set of L p estimates for T. As a 
by-product of our framework we shall be able to give a fairly short (but standard) proof 
of Theorem |1 . 1| . 

Let us consider now the 3-dimensional affine hyperspace 

S := {(«!, «2, CK3, CK4) £ R 4 I Oil + Oil + «3 + «4 = 1}. 

Denote by D' the open interior of the convex hull of the 12 extremal points Ai, A 12 . 
They belong to S and have the following coordinates: 

A 2 :(±,l,l,-§) A 3 : A 4 :(l,i,-|,l) 
4>:(l,-§,0,±) A 6 :(1,-|,|,0) A 7 :(§,-§, 0,1) A 8 : (|, -|, 1, 0) 
A 9 : (-|,1, 0,|) A 10 :(-|,1,|,0) A u : (-|, |, 1, 0) A 12 : ±, 0, 1). 



See |16| for a figure of this polytope. 



Also, denote by D" the open interior of the convex hull of the 12 extremal points 
Bi, 5 12 in S (they are not represented in the picture) where the coordinates of Bj are 
obtained from the coordinates of Aj after permuting the indices 1 and 3 for j = 1, 12 
(for instance B 2 has the coordinates 5 2 (1, 1,1/2, —3/2)). Then set D := D' fl D". 

Our main result is: 

Theorem 1.2. Let 1 < Pi,p 2 ,P3 < 00 and < p' A < 00 such that 

1111 

Pi P2 P3 Pa 

Then T maps 

T : L P1 x L P2 x L P3 -> L p 4 (2) 

as /ono as (l/p x , l/p 2 , l/p 3 , l/p 4 ) G D. 

In particular, T maps L Pl x L P2 x L P3 — > L p ' 4 , whenever 1 < P\,p 2 ,Ps < c>o and 
1 < p 4 < 00. 

For the particular application to Schrodinger eigenfunctions, the functions fi will be in 
L 2 . We thus record the corollary 

Corollary 1.3. T maps L 2 x L 2 x L 2 -> L 2 / 3 . 

In |l(| Theorem |1.2| was proved for a Walsh-Fourier analogue T^j^ p^q of T. From the 



point of view of time-frequency analysis the two operators are closely related, however 
the Walsh model is easier to analyze technically because it is possible in that model to 
localize perfectly in both space and frequency simultaneously. In the Fourier case one has 
to deal with several "Schwartz tails" which introduce additional difficulties. For instance, 
in the Walsh model an inner product (<pp, 0q) of wave packets vanishes unless the spatial 
intervals lp and Iq are nested; however in the Fourier model one needs to consider the 
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case when lp and Iq are separated (although the estimates improve rapidly with the 
relative separation of lp and Iq). 

If m is a function on R 3 , then we define the multiplier operator 

T m (fi,f 2 ,f 3 )(x) := f m(ei,e2,e3)/i(ei)7 2 (6)/3(e3)e 2 ™ (6+ « 2+&) ^i«3. 



In jT5[ such operators were studied, in particular estimates as in Theorem |1.2| were proved 
if m satisfies 

l^m(0|<|dist(^{A,A,A})|-l Q l (3) 

for all multi-indices a. 

As in [Tj| it shall be convenient to split T into two pieces, plus an error term, which 
is a multiplier operator of the type (||). Specifically, we let m'(£) be a function which 
coincides with the characteristic function X6<?3 011 a cone 

{|&-6I«I6-^ L ^I,6<6} 

and satisfies @ outside a smaller cone of the same type (different implicit constant). 
Likewise, we let m"(£) be a function which coincides with the characteristic function 
%i<6 on a cone 

and satisfies (|3|) outside a smaller cone of the same type. Then 

X&<6<Cs -m' - m" 

satisfies ([3]) everywhere and can be estimated^ by the results in |y| . The multipliers m' 
and m" will be specified later. 

Thus to prove our main theorem it suffices to show 

Theorem 1.4. Let 1 < pi,P2,P3 < oo and < p' A < oo swc/i i/iai 

1111 

Pi P2 P3 Pa 

Then T m i maps 

T m , : L P1 x L P2 x L PB -> L p 4 (4) 

as Zona as (1/pi, l/p2 ; 1/P3 5 1/P4) G D'. Similarly for T m » and D". 

We shall only prove the claim for T m / , as the claim for T m n follows by a permutation of 
the 1 and 3 indices. 

We remark that Theorem ( |1.2| ) is about a trilinear multiplier operator whose symbol is 
the characteristic function of the set £1 < £2 < £3- Modifications as in ]/[ give theorems 
for more general multiplier symbols. However, a product structure condition as discussed 
in M seems to occur. While these are questions of interest, we shall not discuss them 
further in this article. 

There are various recurring themes in the subject of multilinear singular integrals as in 
0' 0) EH' EH' E3' EH EH> an< ^ so forth' which the current paper again builds 

1 One can also bound T'" by the arguments in this paper, indeed the treatment would be simpler as 
the tri-tiles P and Q will be constrained to be roughly the same size. 
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up on. While the current article is mostly self contained, we will mark as "standard" any 
arguments that are well understood by now in this framework. Our notation and setup 
follows closely |15| and [16 . 



The authors would like to thank Mike Christ for pointing out to them the occurence 
of multilinear singular integrals of the type discussed in this article in the study of eigen- 
function expansions of Schrodinger operators. 

The first author was partially supported by a Sloan Dissertation Fellowship. The second 
author is a Clay Prize Fellow and is supported by grants from the Sloan and Packard 
Foundations. The third author was partially supported by a Sloan Fellowship and by 
NSF grants DMS 9985572 and DMS 9970469. 

2. Notation 

In this section we set out some general notation used throughout the paper. 

We use A < B to denote the statement that A < CB for some large constant C, and 
A <C B to denote the statement that A < C~ l B for some large constant C . Our constants 
C shall always be independent of P and Q. Given any interval (or cube) /, we let [/[ 
denote the measure of /, and cl denotes the interval (or cube) with the same center as I 
but c times the side-length. 

Given a spatial interval /, we shall define the approximate cutoff function xi by 



r 1/2 , 



where xj is the center of /. 

A collection {u>} of intervals is said to be lacunary around the frequency £ if we have 
dist(£,c<j) ~ \u\ for all u in the collection. 

We define a modulated Calderon-Zygmund operator to be any operator T which is 
bounded on L? and has the form 



Tf(x) = j K(x,y)f(y)dy 

where x, y G R, and the (possibly vector- valued) kernel K obeys the estimates 

\K(x,y)\<l/\x-y\ 

and 

\V x , y (e 2 ^ + y^K(x,y))\<l/\x-y\ 2 

for all x y and for some £, 77 G R. Note that a modulated Calderon-Zygmund operator 
is the composition of an ordinary Calderon-Zygmund operator with modulation operators 
such as / t— > e 2m ^' f. By standard Calderon-Zygmund theory (see e.g. [|T7|]) we thus see 
that T is bounded on LP for all 1 < p < 00, and is also weak-type (1, 1). 

3. INTERPOLATION 



In this section we review the interpolation theory from [15] which allows us to reduce 



multi-linear LP estimates such as those in Theorem |1.2| to certain "restricted weak type 



estimates. This will be the first reduction in our proof of Theorem 1.2 
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To prove the LP estimates on T' it is convenient to use duality and introduce the 
quadrilinear form A' associated to T' via the formula 



AU,/2,/ 3 ,/ 4 ) := / T\f x J 2l h){x)U{x)dx. 

The statement that T" is bounded from LP 1 x L P2 x L P3 to L p * is then equivalent to 
A' being bounded on L Pl x L P2 x L P3 x L P4 if 1 < p 4 ' < oo. For p 4 ' < 1 this simple 
duality relationship breaks down, however the interpolation arguments in will allow 
us to reduce @ to certain "restricted type" estimates on A'. As in [D| we find more 
convenient to work with the quantities a« = l/p%, i = 1,2,3,4, where pi stands for the 
exponent of L p \ 

Definition 3.1. A tuple a = «2, «3, a 4 ) is called admissible, if 

— OO < CKj < 1 

for all 1 < % < A, 



4 
i=l 

and there is at most one index j such that ctj < 0. We call an index i good if a, > 0, and 
we call it bad i/a, < 0. A good tuple is an admissible tuple without bad index, a bad tuple 
is an admissible tuple with a bad index. 

Definition 3.2. Let E, E' be sets of finite measure. We say that E' is a major subset of 
E if E' CE and \E'\ > 

Definition 3.3. If E is a set of finite measure, we denote by X(E) the space of all 
functions f supported on E and such that \\f\\oo < 1- 

Definition 3.4. If a = (aii, ct^i a 4 ) is an admissible tuple, we say that a 4-linear form 
A is of restricted type a if for every sequence E 1 , E 2 , E 3 , E^ of subsets of R with finite 
measure, there exists a major subset Ej of Ej for each bad index j (one or none) such 
that 

\Hfi, f 2 , fs, h)\ < \E\ a 

for all functions fi G X{E' j ), i = 1,2,3,4, where we adopt the convention E[ = Ei for 
good indices i, and \E\ a is a shorthand for 

\E\ a = i^H^M^sM^r 4 - 

The following "restricted type" result will be proved directly. 

Theorem 3.5. For every vertex A i} i = 1, ... ,12 there exist admissible tuples a arbi- 
trarily close to Ai such that the form A' is of restricted type a. 

By interpolation of restricted weak type estimates (cf. PH|| ) we thus obtain 

Corollary 3.6. Let a be an admissible tuple such that a G D'. Then A' is of restricted 
type a. 
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It only remains to convert these restricted type estimates into strong type estimates 
(H). To do this one just has to apply (exactly as in [15|) the multilinear Marcinkiewicz 
interpolation theorem |10 in the case of good tuples and the interpolation lemma 3.11 in 
fl5fl in the case of bad tuples. 

This ends the proof of theorem 1A . Hence, it remains to prove Theorem |3~5 . 

4. Discretization 

We now prove Theorem ETR The first reduction is to pass from the "continuous" form 
A' to a "discretized" variant involving sums of inner products with wave packets. This 
step is standard and appears essential in order for the phase plane combinatorics to work 
correctly. 

Definition 4.1. Let n > 1 and a G {0, §,§}"■ We define the shifted n-dyadic mesh 
D = D™ to be the collection of cubes of the form 

Dl := {T{k + (0, 1)" + (-l)V)|j G Z, k G Z n }. 

We define a shifted dyadic cube to be any member of a shifted n-dyadic mesh. 

Observe that for every cube Q, there exists a shifted dyadic cube Q' such that Q C ^0/ 
and \Q'\ ~ \Q\; this is best seen by first verifying the n = 1 case. 

Definition 4.2. A subset D' of a shifted n-dyadic grid D is called sparse, if for any two 
cubes Q,Q' in D with Q ^ Q' we have \Q\ < \Q'\ implies |10 9 OJ < \Q'\ and \Q\ = \Q'\ 
implies 10 9 Q n 10 9 Q' = 0. 

Observe that any subset of a shifted n-dyadic grid (with n < 3 say), can be split into 
0(1) sparse subsets. 

Definition 4.3. Let a = (cri, (j 2 , o~z) G {0, |, |} 3 , and let 1 < i < 3. An i-tile with shift o~i 
is a rectangle P = Ip x ujp with area 1 and with Ip G D\, up G D\. . A tri-tile with shift a 
is a 3-tuple P = (Pi, . . . , P3) such that each Pi is an i-tile with shift Oi, and the Ip i = Ip 
are independent of i. The frequency cube Qp of a tri-tile is defined to be Yl^i^Pi- 

We shall sometimes refer to i-tiles with shift a just as i-tiles, or even as tiles, if the 
parameters a, i are unimportant. 

Definition 4.4. A set P of tri-tiles is called sparse, if all tri-tiles in P have the same 
shift and the set {Qp: P e P} is sparse. 

Again, any set of tri-tiles can be split into 0(1) sparse subsets. 

Definition 4.5. Let P and P' be tiles. We write P' < P if I pi C I P and 3u>p C 3u)p>, 
and P' < P if P' < P or P' = P. We write P' < P if I P , C I P and 10 7 uj p C 10 7 u p/ . We 
write P' <'PifP'<P and P' £ P. 



The ordering < is in the spirit of that in Fefferman M or Lacey and Thiele [O], WA 



20| , but slightly different as P' and P do not quite have to intersect. This is more 



convenient for technical purposes. 

Definition 4.6. A collection P of tri-tiles is said to have rank 1 if one has the following 
properties for all P, P' G P: 
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If p ^ p' ; then Pj ^ Pj for all j = 1, 2, 3. 
If P- < Pj for some j = 1, 2, 3, then P[ < Pi for alll<i<3. 



// we further assume that \Ip,\ < 10 9 |/f;| ; £/ien we /iai>e P[ <' Pj /or a// i ^ j . 



Definition 4.7. Let P be a tile. A wave packet on P is a function (pp which has Fourier 
support in j^wp and obeys the estimates 

\Mx)\<\i P \- l/2 xi(x) M (5) 



for all M > 0, with the implicit constant depending on M. 

Heuristically, 0p is L 2 -normalized and is supported in P. 
The discretized form of Theorem |375| is as follows. 



Theorem 4.8. Let a, a' G {0, ~, |} 3 be shifts, and let P, Q be finite collections of multi- 
tiles with shifts a, a' respectively such that P and Q both have rank 1. For each i = 1, 2, 3 
and P G P ; let (pp. = (pp ui be a wave packet on Pi. Similarly for each i = 1,2,3 and 
Q G Q let (pQ i = (pQ^i be a wave packet on Qi. Define the form Ap ^ by 

A p,qUi, fa, fa, fa) := Yl I T Jl/2 M( B P*tf* fa), <f>P 2 ) (fa, (pp 3 ) 



PeP 



where 



Bp 2 (f 2 , fa) ■= \]^^ (fa,M{fa,<t>Q2)<i>Q 3 - 



L Q\ 



Then A is of restricted type a for all admissible tuples a G D', uniformly in the parameters 
a, o' , P , Q ; (pp., <pQ.. Furthermore, in the case that a has a bad index j , the restricted 
type is uniform in the sense that the major subset Ej can be chosen independently of the 
parameters just mentioned. 



This theorem is the Fourier analogue of Theorem 2.5 of JT6[. The main new difficulty is 
that unlike the Walsh wave packets (p wa ish,p, the Fourier wave packets (pp are not perfectly 
localized in physical space to the interval Ip. 



In the rest of this section we show how Theorem [3.5| can be deduced from Theorem |4| . 
This section can be read independently of the later sections, which are concerned with 
the proof of Theorem |4.8[ 

Assume Theorem ^4.8| holds. This Theorem is phrased in terms of wave packets which 
are perfectly localized in frequency and imperfectly localized in physical space. We now 
bootstrap this theorem to a similar statement in which no physical space localization is 
made. 

Definition 4.9. Let a G {0, ~, |} 3 be a shift. A collection Q C of cubes is said to 
have rank 1 if one has the following properties for all Q,Q' G Q; 

• If Q 7^ Q' , then Q C\Q' = 0. (In other words, the Q are disjoint). 

• If Q^Q', then Qi ^ Q'i for alii = 1,2,3. 

• IfZQ'j C 3Qj for some j = 1, 2, 3, then 10 7 Q^ C 10 7 Q< for all 1 < % < 3. 

• // we further assume that \Q'\ < |10 9 <5|, then we have 3Q[ fl 3Qi = for all i ^ j . 
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If Q is a cube in R 3 , denote by — Q the reflected cube about the origin; similarly for 
intervals in R. 

Corollary 4.10. Let a, a' G {0, |, |} 3 be shifts, let Q ; Q' be finite rank 1 collections of 
cubes in D\, D\, . For each i = 1,2,3 and Q G Q, let t\q i = r)Q ui be a bump function 
adapted to j^Qi- Similarly for each i = 1,2,3 and Q' G Q' let r]' Q , = tj'q, f 6e a bump 

function adapted to jqQ[- Then the form 

Hfij2,h,U) ■= J 5(6 + 6 + 6 + e4)m(6,6,6,e4)/i(ei)/2(6)/3(6)/4(e4)^i,-..,^4 

is of restricted type a for all admissible tuples a G D' 7 uniformly in the parameters a, a', 
Q, Q'; VQi, Vqo wh ere 

:= ^ J] %i(6)% 2 (6 + 6)^3(^4)^ (6)^ (6)^(6 + &)■ 

QgQ Q'£Q':-Q' 3 CQ 2 

Furthermore, in the case that a has a bad index j , the restricted type is uniform in 
the sense that the major subset E'- can be chosen independently of the parameters just 
mentioned. 

Proof: To motivate matters, let us first consider the simpler object 

for some cube Q G Q. By Plancherel this is equal to 

(/1 * VQi)(x)(f 2 * fj Qa )(x)(f 3 * fjQ 3 )(x) dx 

= KQ) 3/2 J (fl,<f>Qux)(f2,(f>Q 2 ,x){f3,(f>Q 3 ,x) dx 

where 

<P QjtX {y) := l(Q)- 1/2 fj Qj (x - y) 
and l(Q) is the side-length of Q. We can rewrite this as 

/ l J Pr 1/2 (/l ; 0Pi,t,l)(/2,0P 2 ,t,2)(/3,0P 3 ,t,3) dt 

Jo - 

P.Qp=Q 

— * _. 

where P ranges over all tri-tiles with frequency cube Q and spatial interval Ip in Dq, 
4>p h t,j is the function 

<t>Pj,t,j ■= (f)Q j: xp+\Ip\t 

and xp is the center of I p. Note that (f>p jt tj is a wave packet on Pj uniformly in t. 
Similarly, consider 

(h,b Q >) := J 5(6 + 6 + e3)^(6)^ 2 (6)^te)/i(6)/ 2 (6)/3fe)^i«3 

The multiplier m in the statement of the Corollary can now be rewritten as 

A(/i, f 2 , fs, h) = Yl f aq{T)b^{-T)dT 

QeQ,Q'eQ':-Q' 3 cQ 2 
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/ / E 771T7l(A'^i,t,i)( 5 P2,t'(/2,/3),</>P2,i,2)(/4,0P3,i,3) dtdt' 
•Jo Jo - „ „ \ 1 p\ 



P-QpCQ 



where 



?3,t',3- 

i J dr'~ 

Note that the collection of tri-tiles P has rank 1, and similarly for the collection of tri- 
tiles Q. Observe that we can get rid of the complex conjugation sign in the definition of 
Bp2,t' by redefining Q 3 to be — Q 3 and redefining <f)Q 3> t',3 accordingly; this also replaces 
the condition — ujq 3 C Up 2 by the condition uq 3 C up 2 , but it does not change the rank 
one property of the collection Q. The claim then follows by integrating the conclusion of 
Theorem O over t, t', using the uniformity assumptions of that Theorem. (The finiteness 
condition on P and Q can be removed by the usual limiting arguments.) ■ 

We can now prove Theorem [3.5| . By a standard partition of unity we can write 

(6,6,6) = E E ^-(6,6,6) 

CTe { ,I,§}3 QGQct 

whenever 6 + 6 + 6 = 0, where Q a C D\ is a collection of cubes which intersect the 
plane {£i + 6 + 6 = 0} and which satisfy the Whitney property 

10 3 diam(Q) < dist(Q, {2^i = 6, 6 + 6 + 6 = 0}) < 10 5 diam(Q) 

for all Q e Q CT , and for each cube Q G Q a , 4>Q tU is a bump function adapted to -^Q. Note 
that by refining by a finite factor if necessary one can make Q CT have rank 1. 
By splitting 4>q^ as a Fourier series in the 6 we can then write 

(6,6,6) = E c * E E 

^Ql,cr,fc,l(6) r ?Q2,T,fc,2(6) r ?Q3,CT,fc,3(6), 

fceza ae{o,\, f}3 QeQ CT 

where is a rapidly decreasing sequence and r]Q jt a,k,j is a bump function adapted to 
yjjQj uniformly in k (the c& are fractional powers of the Fourier coefficients, whereas the 
VQj,<j,k,j are products of an initial bump function, times a complex exponential, times 
another fractional power of the Fourier coefficient which generates the uniformity of the 
bump function in k). 
Similarly, we can write 

Xgl<&(6,6,6) = 7~J 4 22 E ^Qi^,fc,l(6)^Q 2 , (7 ,A : ,2(6)^Q3, (T ,fc,3(6)7 
fc€Z3 CTg { ,I,|}3QGQ CT 

As a consequence, the expression 

E CkC 'k' E E E VQi,a,k,l(£l)VQ2,a,k,2(& + &)VQ 3 ,a,kdU) 

^Q' 1!Qi ,fc',l(6)^Q 2 ,a,ifc',2(6)^,a,ifc',3(6 + 6) 

on the hyperplane 6 + 6 + 6 + 6 = is equal to \ L >< : . £ 3 when |6 - 61 < 

16 — 2 I' (under the latter constraint the condition — Q' 3 C Q 2 is automatic for nonzero 
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summands) , and satisfies (|j) outside any cone of the type |f 3 - &| < |& - &±&| (then 
the scales of the cubes Q and Q' are essentially coupled and also coupled to the distance 
to the line £1 = 6 = £3-) 

Thus the above expression is a multiplier m'(£i, £2, £3, £4) of the type requested for 
the definition of T'. Theorem |3.5| then follows from Corollary 4. 10| and summing in the 
parameters k, k', er, a', using the uniformity conclusions in Corollary |4.10| . (The finite- 
ness assumption on Q and Q' in Corollary [4.10| can be removed by the usual limiting 
arguments) . 

5. TREES 

The standard approach to prove the desired estimates for the forms Ap ^ is to organize 
our collections of tri-tiles P, Q into trees as in ||. We may assume and shall do so for 
the rest of this article that P and Q„ are sparse. 

Definition 5.1. For any 1 < j < 3 and a tri-tile Pt G P, define a j-tree with top Pt to 
be a collection of tri-tiles TCP such that 

Pj < PT,j for all P G T, 

where Ptj is the j component of Pt- We write It and utj for Ip T and uJp Tj respectively. 
We say that T is a tree if it is a j-tree for some 1 < j • < 3. 

Note that T does not necessarily have to contain its top Pp. 

Definition 5.2. Let 1 < i < 3. Two trees T , T' are said to be strongly i-disjoint if 

• PijLP! for all PET, P' G T. 

• Whenever P e T , P' G T' are such that 2up i fl 2u P i_ 7^ 0, then one has Ip, DIt — 0, 
and similarly with T and T' reversed. 

Note that if T and T' are strongly i-disjoint, then Ip x 2ujp i fl I pi x 2ujpi. = for all 
P G T, P' G T. 

Given that P„ is sparse, it is easy to see that if T is an i- tree, then for all P, P' G T 
and j 7^ % we have 

Up. = Up' 



3 



or 

2up. fl 2u P > 



6. Tile norms 

In the sequel we shall be frequently estimating expressions of the form 

1 E ij^S?«S«Si < 6 > 



PeP 

,(J) 



where P is a collection of tri-tiles and ap, are complex numbers for P G P and j = 1, 2, 3. 
In some cases (e.g. if one only wished to treat the Bilinear Hilbert transform) we just 
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have 

«)'.; (fro,,) (7) 

but we will have more sophisticated sequences a p. when dealing with Ap 5. 
In [HJJ the following (standard) norms on sequences of tiles were introduced: 

Definition 6.1. Let P be a finite collection of tri-tiles, j = 1,2,3, and let (op,-)p e p be a 
sequence of complex numbers. We define the size of this sequence by 



size^apjpgp) := sup(-— \a P .\ 

TCP \ lT \ 



|2U/2 



Per 



where T ranges over all trees in P which are i-trees for some i ^ j. We also define the 
energy of the sequence by 



energy^ (a Pj )p e p) := sup(^ \a P .\ 



|2U/2 



DCP PeD 

where D ranges over all subsets o/P such that the tiles {Pj : P G D} are pairwise disjoint. 

The size measures the extent to which the sequence ap j can concentrate on a single 
tree and should be thought of as a phase-space variant of the BMO norm. The energy is 
a phase-space variant of the L 2 norm. As the notation suggests, the number dp. should 

be thought of as being associated with the tile Pj rather than the full tri-tile P. 

In the Walsh model the energy is a tractable quantity; for instance, if ap j is given by 
(0) then one can control the energy by \\fj\\2 thanks to the perfect orthogonality of the 
Walsh wave packets. However, in the Fourier case the orthogonality is too poor to give a 
usable bound on the energy, and so we must instead use a more technical substitute. 

Definition 6.2. Let the notation be as in Definition \6. 1\ We define the modified energy 
of the sequence ( fl p j )p 6 p by 

^gyjdap^p^) := supsu P 2"(V \I T \)^ 2 (8) 

neZ T 

where T ranges over all collections of strongly j- disjoint trees in P such that 

(J2\apf) 1/2 >2 n \I T \^ 2 

Per 

for all TeT, and 

^Pf)^ < 2H+1 \ I T'\ 1/2 

Per 1 

for all sub-trees T' C T e T. 

The reader may easily verify that the modified energy is always dominated by the 
energy, and that we have the monotonicity property 

energy i ((ap 3 .)^ ; ) < energy^- ((a P] ) Pe p) 
whenever P'cP. From duality we see that 











I T \ 
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Lemma 6.3. Let the notation be as in Definition \6.% For any sequence {ap)p € p, there 
exists a collection T of strongly j-disjoint trees, and complex co-efficients cp j for all P G 
[J TgT T such that 

e°&Syj{(aPj)pe?) ~ I EE ap ^l' 

TGTp eT 

and such that 

£kp/< 

PeT' 

for allT G T and aZZ sub-trees T'CT o/T. 

Proof: Let n, T be an extremizer of (|8|), and take cp := 2 _n (^ TgT |ir|) _1 ^ 2 op.- for all 
Pe{J TeT T. m 

The usual BMO norm can be written using an L? oscillation or an L 1 oscillation, and 
the two notions are equivalent thanks to the John-Nirenberg inequality. The analogous 
statement for size is 

Lemma 6.4. Let P be a finite collection of tri-tiles, j = 1,2,3, and let (opjpvp be a 
sequence of complex numbers. Then 



size 5 ((a P .)^) ~ sup t^UV] \a Pj | 2 ^) 1/2 || l i,^ (/t) (9) 

TCP 1^1 p&r \lp\ 

where T ranges over all trees in P which are i-trees for some i ^ j . 



Proof: The same as in [16], Lemma 4.2 . ■ 

The following estimate is standard but we reproduce a proof in the Appendix for easy 
reference. This is the main combinatorial tool needed to obtain estimates on (|5]). 

Proposition 6.5. Let P be a finite collection of tri-tiles, and for each P G P and j = 

(i) 

1, 2, 3 let dp be a complex number. Then 

I E rj^ G ff a S a Sl S nsize i ((ag)^)^e^y,((ag i ) )^) 1 ^ (10) 
PeP i =1 

for any < 61,82,63 < 1 with 61 + 62 + 63 = 1, with the implicit constant depending on 
the 6j. 



Note that this Proposition is stronger than that the corresponding statement (JT 
Proposition 4.3) for the unmodified energy. 

Of course, in order to use Proposition |6.5| we will need some estimates on size and 
energy. In the rest of this section we give these estimates in the case when a^' is given 
by®. 

We begin with a standard variant of Bessel's inequality for Fourier wave packets. 
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Lemma 6.6. Let 1 < j < 3, let T be a collection of strongly j-disjoint trees in Q ; and 
for each Q G T G T let CQ j be a complex number such that 

J2_\c Qj \ 2 < A\I T ,\ (11) 
for allT G T and sub-trees X" C T ofT, and some A > 0. Then we have 

iiEE c ^J^E^i) 1/2 - 

Proof: In the Walsh case this is immediate from Bessel's inequality. The argument in 
the Fourier case is more technical, however. 

We may assume all trees in this lemma are sparse. 

By squaring both sides, we reduce to showing that 

E E E WMfe^i^Ew 

T,T'eT QgT Q'eT' TeT 

We may assume that ujq . H ujq 1 . 7^ since the inner product vanishes otherwise. By 
symmetry we may thus assume that \ojq.\ < \ojqi.\. 

3 3 

From the decay of the 4>q we have 
so it suffices to show that 

2^ 2_, I c ^I c q;It/jt/^ 1 + — rrq — ) ^ a 2^\ i t\- 

Let us first consider the portion of the sum where \ujq.\ ~ |^q^.|- In this case we 
estimate |cq,.||cq^| < \cq\ 2 + Icq'J 2 - We treat the contribution of the first term (cqJ 2 , as 

the second is similar. For each fixed Q, there are only 0(1) many candidates u to appear 
as ujq', satisfying all the above conditions, and for each fixed u the T", Q' summations have 

disjoint spatial intervals 1^,. One can then perform the T',Q' summations and estimate 
this contribution by 

EEM 2 

which is acceptable by fllTl) . 

It remains to consider the contribution when \ujq.\ <C \&q'.\- From (|TT|) applied to the 

singleton trees {Q}, {Q'} we have 
It thus suffices to show that 

EE E + d "^: / » ) )- m < N 
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for all trees T G T. 

From the assumptions on Q and Q' and sparseness of the trees we see that the tree 
T' which contains Q' must be distinct from T. By strong j-disjointness this implies that 
Ia, R It — 0- Also from strong j-disjointness we see that the Ia, are disjoint. We thus 
have 

^-^ dist(/^,/^,) 1nn r dist(/^,x) 

£ I W + m ) 00 S / (1 + n ) dx 



Q'GT':a;o j .na; g ,^0;|/ (3 |>|J (5 ,| 



and the claim then follows by summing in Q. 
As a consequence we have 



<|/rfl(l + 



distal*) 



Lemma 6.7. Lei j = 1, 2, 3, / 3 - fre a function in L 2 (\R), and let P be a finite collection 
of tri-tiles. Then we have 

^EgyjWj^pJWp) < Wfjh- (12) 



Proof: By Lemma |6.3| we may find a collection of strongly j-disjoint trees T, and 
complex co-efficients Cp j for all P G Utst such that 

energy Mfj,^)) pep) ~ ^^c Pj ^)l, 

and such that 



E 











7 T | 



PeT' 

for all T G T and all sub-trees T' C T of T. The claim then follows from Cauchy-Schwarz 
and Lemma |6.6| . ■ 

Lemma 6.8. Let j = 1,2,3, Ej be a set of finite measure, f) be a function in X(Ej), 
and let P be a finite collection of tri-tiles. Then we have 

wXj(((fjAp s ))p e p) ~ SU P % \ P ( 13 ) 
for all M, with the implicit constant depending on M. 

Proof: This is essentially Lemma 7.8 in |TJ| (see also Lemma 4.5 of [|16||), but we give a 
proof here for completeness. 

By Lemma |b.4] it suffices to show the estimate 



;Ei^^)i 2 n^) 1/2 ii^^ / 

- \-ip\ Je 



PeT 

for all i j and z-trees T. 

Fix T. By frequency translation invariance we may assume that u>tj contains the origin. 
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Let us first assume that fj is supported outside of 2It- From the decay of <pp j we have 

l</i,^>l<(|^[) M |/T|- 1/2 f Xt (14) 
Applying this estimate, we obtain 



:EK/-^)i 2 ^T) 1/2 n^i^r 1/2 / 

- _ \ 1 P\ JEj 



P&T 

and the claim follows from Holder. 

Now suppose that fj is supported on 2I T . It suffices to show that 

l{(£ \{fM?^r\) 1/2 % «>l S ^\E 3 n 2J T |. (15) 

rp 

PeT 

for all a > 0. 

Since cupj- contains the origin, we see that the vector-valued operator 

l J pl 

is a Calderon-Zygmund operator, and is hence weak-type (1, 1); note that the L 2 bound- 
edness of this operator follows from the almost orthogonality of the (pp.. (For more general 
Utj this operator would be a modulated Calderon-Zygmund operator). The claim follows 
from standard Calderon-Zygmund theory. ■ 

In the next section we shall show how the above size and energy estimates can be 
combined with Proposition 6J5 and the interpolation theory of the previous section to 
obtain Theorem To prove the estimates for the trilinear operator Tpq we need some 
more sophisticated size and energy estimates, which we will pursue after the proof of 



Theorem 1.1 



7. Proof of Theorem |1.1| 



We now sketch a proof of Theorem |1 . 1| . The proof here is standard, but we give it here 
for expository purposes, and also because we shall need Theorem [O] to prove the size 
and energy estimates needed for Theorem R3. 



We dualize B into the trilinear form Abht defined by 



ABHT(/l, / 2 , h)(?) := / /l(£l)/2(6)/ 3 (&W£l + 6 + 6) #1^2 #3, 

with 5 denoting the Dirac delta. 

By standard discretization arguments as in Section [| we may reduce the study of A BHT 
to that of discretized operators of the form 

a bht.p '■= \ t iw (h>M(h>M(h>M ( 16 ) 

PeP 

— * 

where P is some finite collection of tri-tiles of rank 1. 

We shall use the notation of Section ^|, with the obvious modification for trilinear 
forms as opposed to quadrilinear forms. From the interpolation theory in [15[ it suffices 
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to show that A BHT p is of restricted weak type o for all admissible 3-tuples (0:1,02,03) 
in the interior of the hexagon with vertices given by the six possible permutations of 
(1, 1/2, —1/2). By symmetry and interpolation it suffices to prove restricted weak type o 
for admissible 3-tuples o arbitrarily close to (1, 1/2, —1/2), so that the bad index is 3. 

Fix P, o as above, and let E\, E 2 , E 3 be sets of finite measure. We need to find a 
major subset E' 3 of E 3 such that 

|A BHT) p(/ a ,/2,/ 3 )| <\E\° 

for all functions /j G X(E^), i = 1, 2, 3. 
Define the exceptional set Q by 

3 

n:={J{M XEj >C\E 3 \/\E 3 \} 

i=i 

where M is the dyadic Hardy-Littlewood maximal function. By the classical Hardy- 
Littlewood inequality, we have \Q\ < \E 3 \/2 if C is a sufficiently large constant. Thus if 
we set E' 3 := E 3 \ Q, then E 3 is a major subset of E'3. 
Let fi G X(E' i ) for i = 1, 2, 3. We need to show 

I V 1 n (1) n ( V 3) \ <\F\ a 7^ 
\ \l/2 a Pi a P2 a P3 I ~ 1^1 U'J 



where a^p. is defined by 



We shall make the assumption that 

dist(/^,R\fl) fc 

m ~ 

l -'pi 

for all P G P, for some > independent of P, and prove (|l7f) with an additional factor 
of 2~ fc on the right-hand side. If we can prove ([17]) in this special case with the indicated 
gain, then the general case of (|T7D follows by summing in k. 
Fix k. By the definition of Q we have 



I^pI ~ \E 3 \ 



for all P G P and j = 1,2 and M > 1, while 

Ie' Xi 



\Ip\ 



< 2 (-M+C)fc 



for all P G P and M ^> 1 (of course, the implicit constant depends on M). 
From Lemma |6.8| we thus have 

size J ((af) PeP )<2 kl -^ 



3 



for j = 1,2, while 

size 3 ((ag) p ^) < 2~^ 

for any M. 
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Also, from Lemma |6.7| and the fact that fj £ X(E'a) we have 

&^ s ((a®)p&)<\E s \W 

forj = 1,2,3. 

From Proposition |6^ and a suitably large choice of M depending on the 9j we thus 



have 

3 



ej)/2/i%l 



LB, 



iEOT^gi^2-ni^i (i - 

for any < 61,82,63 < 1 such that 61 + 6 2 + 6 3 = 1. The claim then follows by choosing 
61 := 2a\ — \, 62 '■= 2a 2 — 1, and 63 := 2a 3 + i; note that there exist choices of a arbitrarily 
close to (1, 1/2,-1/2) for which the constraints on 6\, 62, 3 are satisfied. This concludes 
the proof of Theorem |L1[ 

8. Energy estimates 

The purpose of this section is to prove some additional energy estimates in the spirit 
of Lemma IO. 



From Lemma |6.6] and the Cauchy-Schwarz inequality we have 

Corollary 8.1. Let T be a collection of strongly 3-disjoint trees in Q ; and for each Q £ 
T £ T let cq 3 be a complex number such that 

Qef |/t| 

for all f C T £ T . Also, let T' be a collection of 1- disjoint trees in P, and for each 
P £ T' £ T' let dp 1 be a complex number such that 

- - Z^T'GT' \ 1 T'\ 

Per fc 
for all T C T' £ T'. T/ien we /ia^e 

1 E EEE^^W^i- (20) 

T'eT' p eT , TeT q sT 



In the rest of this section we shall prove (|20|) when the constraint C cjpj has been 
inserted into the summation. In [|l6j this was accomplished by a geometric lemma (||16||, 
Lemma 6.1) which allowed one to decouple the uq 3 C up 1 constraint assuming a priori 
that Pi n Q3 0. In the Walsh case this assumption was reasonable, however in the 
Fourier case we do not have perfect orthogonality in space and so we can only assume 
that ujp 1 fl ljq 3 7^ a priori. However, we still have the following weaker analogue of |16| , 
Lemma 6.1 when the Q tri-tiles are constrained to a tree. 

Lemma 8.2. Let T C Q be a (sparse) i-tree for some i = 1,2, and define the collection 
P' C P of tri-tiles by 

P' := {P £ P : u Qa C u Pl for some Q £ T}. 
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Then, if Q G T and P G P are such that (<pp 1 , 4>q 3 ) 7^ 0, then 

uq 3 C u Pl Pe P'. 

Proof: If ((ftp,, (pQ 3 ) 7^ 0, then ujq.^ Hup, 7^ 0. The claim then follows from the sparseness 
of T. m 



Lemma 8.3. Let the notation be as in Corollary \8. | . Then we have 

i E E E E ^^(^,^3)1 < 1. (21) 

T'GT' p eT / TGTQ eT:WQ3 c Wpi 

Proof: 

We can divide into the cases 

El^El^'l (22) 

TGT T'GT' 

and 

Ei^Ei^i- ( 23 ) 

T'GT' TGT 

In the case (ESI) we use (1201) to reduce (1211) to 



E E E E c Q 3 d Pi(0Pi,M \ ^ L 

TGT g 6T T'GT' p eT , :Wpi c^ Q3 



But the proof of this estimate is essentially the same as ( pT|) with the roles of P and Q 
reversed. Thus it suffices to prove (^T|) under the assumption (^%). 

We first consider the set of all pairs uq s C uj Pi such that ut,3 <f. 2up,, where T is the 
tree in T containing Q. These constraints imply 10 9 1^(33 1 > \oJp,\. By splitting into 0(1) 
cases we may assume that the ratio between \uq 3 \ and l^pj is fixed. We may also assume 
that the distance of Iq and Ip is 2 fe |/Q| for some fixed k, provided we prove the final 
estimate with an extra factor of 2~ k . However, then we have 



5 >T^0 



only for a bounded number of essentially unique P for any given Q, and for those P we 
have 

Hence we can estimate the corresponding piece of (PH ) using Cauchy-Schwarz by 

2- fc (EEi c ^i 2 ) l (E Ei^H^ 2 "* 

TGTg gT T'GT'p eT , 

Now we consider the pairs (P,Q) with ut,3 C 2,up 1 , where T is the tree containing Q. 
We estimate the corresponding part of the left-hand side of (|2~T|) by 



EE'^ikE E dpi<M>0Q3/ 

TeT QGT T 'e T ' PeT':UQ 3 Cu Pl ,ijT,3C2uJp 1 
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By fl22|), it suffices to show that 

QeT r '6T' P£T':UQ 3 Ctjp 1< L> T ,3C2cjp 1 



< 





h 


1 


(z~2peT 


hi) 1 / 2 (Y^T'eT 1 


It'\Y' 2 



(24) 



for each T G T. 

Fix T. Let us first estimate the contribution of the case when lp D 2J^ 7^ 0. Define the 
collection Pt by 

P T := {P G [J T' : lp n 2J T ^ 0; cj t , 3 C 2u; Pl ; cu Q3 C u Pl for some Q 3 G T}. 

T'eT' (25) 



By Lemma |8.2| we may rewrite the contribution of this case to ( P4|) as 

QeT 

where 

h T := ^ d Pl 4> Pl . 
PeP T 

By Cauchy-Schwarz and (|18|) we can bound the previous by 



^) 1/2 (Ek^^)i 2 ) v 



1/2 

V«i j 7^3/1 I 

QeT 

The 0q 3 are almost orthogonal as Q G T varies, so we can bound this by 

(^W /2 IMI 2 . 

Z^TeT \ 1 T\ 

It thus suffices to prove 



\\h T h< ^ ' |W2 - (26) 

We write the left-hand side as 

II J] ^i^Pilh- 

T'eT' p eT 'nP T 

We now consider each {P} with P G Pp as a tree by itself. By strongly 1- disjointness 
of the tree T' we see that lp with P G Pr are pairwise disjoint. Moreover, they are 
contained in 3Ip. In particular we have J2p^p T \Ip\ ~ \^t\, and the claim ( p6|) follows 
from Lemma |6.6| . This concludes the treatment of the case lp n 2/^ 7^ 0- 

To finish the estimation of ( p4"D it remains to treat the contribution of the case lp fl 
2 k Ip = and lp fl 2 k+1 Ip 7^ for each k > 0, with an additional factor of 2~ k on the right 
hand side. 
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Fix k > 0. In this case we use the crude estimate 



\cq. 



< 

3 I ~ 



from (|T8|), and reduce to showing that 

|<E E 



dp. 



T'eT' p e T':WQ 3 Cwp 1 ;a; T ,3C2ajp 1 ;/ i 3n2 fc / T =0;7 j3 n2 fe + 1 / T ^0 



,10 







1/2 




l^|) 1/2 



for each QeT. 
Fix Q. We split 



into 4>Q 3 X2 k ~ 1 i T an d 0q 3 (1 — X2 fc - 1 / T )- To control the former 



contribution we use the crude estimates 

\d Pl \< 

from (|19|) and 







1/2 


(X/T'eT' 





l/i / \i < -ioofe/ iQUioo/ IfpKioo 

and sum crudely in P. To control the latter contribution we observe that 4>Q 3 X2 k ~ 1 i T has 



an L 2 norm of O(2- 100/c (^|) 100 ), so it suffices to show that 

ii E E 

T'eT' p eT /. WQ3 Ca,p l ;/-n2 ft 7 T =0;/^n2 fe + 1 7 T ^0;w Ti 3C2wp 1 

But this follows by repeating the proof of 
From Lemma 18.3 and Lemma 16.31 we have 



< 2 50k ( !i2-±) 50 





I T \ 


1/2 




|/ T ,|)V2 



Corollary 8.4. Let T be a collection of 3- disjoint trees in Q, and for each Q G T G T 
let cq 3 be a complex number such that ([Tj|) holds for all T C T G T . T/ien 

energy^^ c Q3 (0 Pl , 4>Q 3 ))p e p) < 1. 

9. Additional size and energy estimates 

In the expression Ap ^ the Q tiles in the inner summation have a narrower frequency 
interval, and hence a wider spatial interval, than the P tiles in the outer summation. 
Thus the inner summation has a poorer spatial localization than the outer sum. It shall 
be convenient to reverse the order of summation so that the inner summation is instead 
more strongly localized spatially than the outer summation. Specifically, we rewrite Ap g 
as 



Ap q(/i, f'2, f'3, U) - ^2 ITU 



(1) (2) (3) 
/2 a Qi a Q2 a Q 3 



QeQ 
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where 



a Q [ ■= (fuM 

a { Q ] 2 := (f2,4>Q 2 ) 

qI '= Yl I r 1 1 /2 (/3, 0P 2 ) (/4, 0P 3 ) (<f>Px > M • 



The purpose of this section is to prove analogues of Lemma |6.8| and Lemma |6.7| for Oq\ 

Lemma 9.1. Let Ej be sets of finite measure and fj be functions in X(Ej) for j = 3,4. 
Then we have 

-.((■gW S -P f%^V" f )' (28) 

for any < 9 < 1 and M > 0, with the implicit constant depending on 9, M. 
Proof: By Lemma it suffices to show that 



f yM r ~M 

|_(3)i2 Xl Q \l/2 II <|r I „„„( jE 3 Xl \l-Of jE 4 Xl <3 



QeT 



for any i = 1, 2 and any z-tree T. We may assume (as in the proof of Lemma |6.§| ) that T 
contains its top Pt, in which case we may reduce to 

ll(El a Sl 2 ^7) 1/2 H^)^( / Xf^if xfj- (29) 

Fix T. To prove ( p9|) , first consider the relatively easy case when / 3 vanishes on 5/t. 
In this case we shall prove the stronger estimate 

i«Si<i%r 1/2 (/ x&YAf xU (so) 



E 3 Q JE 4 Q 



< \lJ-l/2(L9±)M(l-6)( / ~M\l-6 



Xl T ) { Xl T 
E 3 JE 4 



~M\0 



for all QeT\ the claim ( ^9|) then follows by square- summing in Q. 
We now prove (30). Fix Q 6 T. By (p7|) and (||) we may estimate 



f Xj 



T-lOOAf 

A/ p -100M 
A/, • 



PeP ;u)q 3 Cupj 

Interchanging the sum and integral and applying Cauchy-Schwarz we thus have 

I _(3) I < f \q f || a f I-100M 
l a Q 3 l ~ / l^2/3|P3j4|X/ (3 

where for j = 2, 3, the square function Sj is the vector-valued quantity 

Xj° M 

S jf '■= ((/> WjyqT^PjwgjCw^" 
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To show (30), it thus suffices by Holder to prove the weighted square-function estimate 

II^/IIl^x} 00 " dx) < dx) ( 31 ) 

Q Q 

for all 1 < p < oo and j = 2, 3. But this follows since Sj is a modulated Calderon- 
Zygmund operator^] whose kernel Kj(x,y) decays like 0(|/ ( 5|~ 1 (|x — 2/|/|/q|)~ 50M ) for all 

\ x ~y\ > I J qI- 

This proves fl29|) when / 3 vanishes on 5It- A similar argument gives ( p9|) when / 4 
vanishes on 5/^- We may thus reduce to the case when / 3 , / 4 are both supported on 5It- 
We may then assume that E 3 , E A C 5 It- 

Define the collection P' C P of tri-tiles by 

P' := {P G P : cjq 3 C u Pl for some Q G T}. 
From Lemma 18.2 and (W7l) we have 



PeP' 



= (Bpr(f3,f4),(t>Q 3 ) 

where 

Bp,(f 3 ,h) ■= Yl I r 1 1/2 (/3 , 0P 2 ) (A , 0P 3 ) 0Pi ■ 



PeP' 

To prove (|29|) it thus suffices to show that 



,~ioo 
, a/- 



(EK fi P'(^/4),0Q 3 )| 2 T^) 1/2 |U^ < I^M^ 

QeT Q 



The vector-valued operator 



/-((/^-fr)^ (32) 
is a modulated Calderon-Zygmund operator, so it suffices to show that 

||Sp,(/ 3 ,/4)||l<||/3||l/ M ||/4||l/ e . (33) 

But this follows from Theorem |1.1| (or more precisely, the analogue of Theorem |1.1| for 
the discretized operator Bp,). This finishes the proof of (|28|) . ■ 

The analogue of Lemma |6.7| is 

Lemma 9.2. Let Ej be sets of finite measure and fj be functions in X(Ej) for j = 3,4. 
Then we have 

^ S ((fl§W< (^ 4|1/2 ^W) ( |£3|1/2 ?pW) ^ 



2 This is ultimately because the constraint ojq 3 C Wp i; combined with the rank 1 hypothesis, forces the 
intervals top. to be lacunary around some frequency £. 
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for any < 9 < 1 and M > 0, with the implicit constant depending on 9, M. In particular, 
we have 

eSS&aCO^W £ |£4| (1 -* )/2 |£ 3 r (35) 
for any < 9 < 1, with the implicit constant depending on 9. 



Proof: By Lemma 5.3, it suffices to show that 





If 









lEE a ^l< fi^| 1/2 sup%^V f|^rsup%^) (36) 
for all collections T of strongly 3-disjoint trees, and all co-efficients cq 3 such that 

Qef 

for all f C T e T. 

Fix T, cq 3 . By ( f27| ) we may write the left-hand side of ( |3"6| ) as 

|Et71I72 6 S^ 3 '^)^'^3)I 

- - rp 
PeP 

where 

6 !? := E E <fo> (37) 
The claim then follows from Proposition |6.5|, Corollary |8.4j, Lemma |6.7|, and Lemma 



10. Proof of Theorem 3.5 for A 5 , . . . , A u 



Let o = (0:1,0:2,03,04) admissible tuples near A, for some 5 < i < 12. We will only 
consider those vertices with bad index 1 (i.e. Ag, . . . , A12) as the other four vertices can 
be done similarly. Thus o has bad index 1. Let us also fix arbitrary sets E±, E2, E3, E4 of 
finite measure. 



As in the proof of Theorem |1.1| , we define 

4 

Q := \J{M XEj > ci^i/l^il} 

i=i 

for a large constant C, and set E[ := E x \ Q. We now fix fa e ^(E^) for z = 1,2, 3, 4. Our 
task is then to show 



E l (1) (2) (3) I < I p |q /« 



QeQ 

where the Oq. are defined by 
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As before, we may make the assumption that 

dist(jg,R\n) 
m ~ 

for all Q G Q and for some k > independent of Q, provided that we gain a factor such 
as 2~ fc on the right-hand side of (BSI) . As before, we then have 

f V M r-i 

J Ej X Iq < 2fc |^| 



for all Q G Q and j = 2, 3, 4 and M > 1, while 



< 2 (-M+C)A; 



for all Q G Q and M > 1. 

With this assumption we have From Lemma ^8 and Lemma EO we thus have 

sizei((agJ)(5 e Q) < 2" 

^e 2 ((ag) (5ea )<2 fc 

^e 3 ((ag) (5ea )<2 fc 

for some < 9 < 1 which we will choose later. Similarly, from Lemma |6.7| , fl35|) and the 
hypotheses /j G X(Ej) we have 

eh^iy 1 ((ag i ) )^) < (i^ 
e£5^ 2 ((ag)^) < |£ 2 | 1/2 
e^y 3 ((ag)^)<|^|^ 2 |^| e/2 . 



(M+ 


C)k 






|#1 






l - e \Ei\ 6 


\Ei\ 



By Proposition |6.5| and choosing M sufficiently large we can thus bound the left-hand 
side of PD by 



2 -k 





(l+6»i)/2 


£J 2 


(l+02)/2(|£ 3 


1-6* 


#4 


e\(i+0 3 )/2 









for < 0i, 2) 9 3 < 1 such that 9i + 9 2 + 9 3 = 1. The claim follows by setting 6 1 ! := 2q;i + 1, 
9 2 := 2a 2 — 1, 6*3 := 2(a 3 + a 4 ) — 1, and 9 := a 4 / (a 3 + a 4 ); the reader may verify that the 
constraints on 9±, #2, 9 3 , can be obeyed for a arbitrarily close to A$, A w , An, A\ 2 . 



11. Proof of Theorem |375] for A 1j A 2 ,A 3 ,A 4 

Let a = (ai, a2, 03, 0:4) admissible tuples near Ai for some 1 < i < 4. We will only 
consider those vertices with bad index 4 (i.e. Ai, A 2 ) as the other two vertices can be 
done similarly. Thus a has bad index 4. Let us also fix arbitrary sets E\, E 2 , E 3 , E 4 of 
finite measure. 
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As before, we define 

4 

Q:= \J{M XE] >C\E 3 \/\E 4 \} 

3=1 

for a large constant C, and set E' A : = £4 \ Q. We now fix /, G -X'(-E'i) for z = 1, 2, 3, 4. Our 
task is then to show 



E uVM-Si * (39) 



where the Oq. are defined by 

(3) 

Recall that a Q is defined by 



(3) 
a Q3 : = 



XI I T )i/2 ^2) (A, 0P 3 )(0Pi, 0Q 3 )- 

rpl 

We may make the assumptions that 

dist(^,R\n) 
m ~ 

for all Q G Q and for some fc > independent of Q, and that 

dist(/ p ,R\fl) fc , 

m ~ 

for all P G P and for some fc' > independent of P, provided that we gain a factor such 
as 2~ k ~ k ' on the right-hand side of (p8|). 
As before, we then have 

JE i Xl Q < ofel^jl 

\ j q\ ~ 1^1 
for all Q G Q and j = 1, 2, 3 and M > 1, while 

for all Q G Q and M 3> 1 . From Lemma |6.8| we thus have 



^ei((a«) a6a )<2 fe|jLl 



l^4| 



^((ag)^) < 2< 



\E,\ 

From Lemma [9J] and the crude estimate f E — l-^pl we a l so have 
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From Lemma |6.7| we have 

e^y^ag)^) < {E^ 2 
eSS&atfag)^) < \E 2 \ 1 ' 2 . 
Finally, from the definition of k' we have 

Ie 3 < 2 fc 1-^31 



Kpl ~ 1-^4 



< 2(~M+C)k 



PI 



for all P G P. By (0) we thus have 

e-n^3((«g)Q e q) < 2-(^'|P 3 |^ 2 |P4| (e - 1)/2 

for some < 9 < 1 to be chosen later. 

By Proposition |0| we can thus bound the left-hand side of (|39|) by (if M is chosen 
sufficiently large) 





(i+< 


'l)/2 


1^2 


(l+0 2 )/2 




P 4 | 


1-03 



2 -fe 2 -A ; 'i£2j L_3j Q £ j 3 |(2-0)/2|_g 4 |(0-l)/2^1-03 



and the claim follows by setting 9\ := 2ai — 1, 9 2 := 2a 2 — 1, #3 : = 2(03 + a.4) + 1, and 
9 := (3a3 + 2a4)/(a3 + 04); the reader may verify that the constraints on 9%, 9 2 , 6*3, 9 can 
be obeyed for a arbitrarily close to A\, A 2 . 



12. Appendix: Proof of Proposition [675| 



We now prove Proposition |6.5| . This is the analogue of Appendix III in ||16|| , but using 
the necessary modifications for the Fourier case. Our arguments shall be modeled on 
those in [fR) |, Section 9, which were in turn inspired by [ ff^ . 



Fix the collection P and the collections dp'. We continue to assume P is sparse. 
We adopt the shorthand 



SIZEj := size i ((ag)p 6P ); ENERGYj := &uxgy j ((a%) Pf #). 

We may of course assume that a^p_ are always non-zero. We begin by considering the 
contribution of a single tree: 



Lemma 12.1 (Tree estimate). Let T be a tree in P, and Op. 6e complex numbers for all 
P € T end j = 1, 2, 3. JTien 



(1) (2) (3) I 
/2 fl Pi Q P 2 Q P 3 I 



< 



Per 



I It I size j((a 



Pj JPtTJ- 
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Proof: This proof is reproduced verbatim from [16 



Without loss of generality we may assume that T is a 3-tree. We then use Holder to 
estimate the left-hand side by 

l„(3)l 

El4^) 1/2 (£kgiY /2 su P 



Per Per 
From Definition 16.11 we have 



MA 1 / 2 ' 



(Eia^| 2 )^<|j r |V2 size . ((a a) )peT) 

PeT 

— * — * 

for j = 1, 2. Also, since the singleton tree {P} is a 1-tree with top P, we have 



\a 



(3) I 



l^pl 1/2 



size 3 ((ap 3 0p eT ) 



for all P G T. The claim follows. 



To bootstrap this summation over T to a summation over P we would like to partition 
P into trees T for which one has control over ^ T \Ip\. This will be accomplished by 

Proposition 12.2. Let 1 < j < 3 ; P' be a subset ofP, n G Z, and suppose that 

size jda^p^,) < 2~ n ENERGYj. (40) 

Then we may decompose P' = P" U P'" swc/i i/iat 

sizej^a^)^,,) < 2~ n ~ 1 ENERGY j (41) 



and i/ia£ P'" can be written as the disjoint union of trees T such that 

£ |/p| < 2 2 ™. (42) 
TeT 

Proof: The idea is to initialize P" to equal P', and remove trees from P" one by one 
(placing them into P w ) until fl4i"l) is satisfied. 

If P is a tile, let £p denote the center of up. HP and P' are tiles, we write P' < + P if 
P' <' P and £ P , > £ P , and P' <~ P if P' <' P and f P , < 

We now perform the following algorithm. We shall need a collection T of trees, which 
we initialize to be the empty set. We consider the set of all trees T of type i ^ j in P 
which are "upward trees" in the sense that 

Pj <+ P TJ for all P G T (43) 

and which satisfy the size estimate 

£|a P ; ) | 2 >2- 2n - 3 |/T|. (44) 

PgT 

If there are no trees obeying (|43|) and (|44]) , we terminate the algorithm. Otherwise, we 
choose T among all such trees so that the center £tj of ujp T j is maximal (primary goal), 
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and that T is maximal with respect to set inclusion (secondary goal). Let T' denote the 
j-tree 

V := {P G P\T : Pj < P TJ }. 

We remove both T and X" from P, and add them to T. Then one repeats the algorithm 
until we run out of trees obeying ( |43|) and fl44l) . 

Since P is finite, this algorithm terminates in a finite number of steps, producing trees 
Ti, T[, T 2 , T%, . . . , T M , T' M . We claim that the trees Ti, . . . , T M produced in this manner 
are strongly j-disjoint. It is clear from construction that T s fl T s > = for all s 7^ s'; by 
the rank 1 assumption we thus see that Pj ^ Pj for all P G T s , P' G T s i, s ^ s 1 . 

Now suppose for contradiction that we had tri-tiles P G T s , P' G T s i such that 2o>p. C 
2ujpi and I pi C J Ts . From the sparseness assumption we thus have \oup>\ > 10 9 |u;p j |. Since 
Pj < + PT a j an d Pj ^ + PT s ,,ji we thus see that £p T / j < (,p Tg j- By our selection algorithm 
this implies that s < s'. 

Also, since \ujpi\ > lO 9 )^!, Jpj C I Tg , and Pj < Pt s j we see that Pj < Pr s ,j- Since 

s < s', this means that P' G T s '. But T s ' and T s i are disjoint by construction, which is 
a contradiction. Thus the trees T s are strongly j-disjoint. From this, (|44D , (^0|), and 
Definition |6T^ we see that 

M 
s=l 

Since T' s has the same top as T s , we may thus add all the T s and T' s to T while respecting 
(0)- 

Now consider the set P of remaining tri-tiles. We note that 

\a { £\ 2 <2- 2n - 3 \I T \ (45) 



for all trees T in P, since otherwise the portion of T which obeyed ( 13 ) would be eligible 
for selection by the above algorithm. 

We now repeat the previous algorithm, but replace < + by <~ (so that the trees T are 
"downward-pointing" instead of "upward-pointing") and select the trees T so that the 
center is minimized rather than maximized. This yields a further collection of trees 
to add to T while still respecting (|42|) , and the remaining collection of tiles P has the 
property that 

\a { £\ 2 <2- 2n - 3 \I T \ (46) 
for all trees T in P. Combining (f45|) and ([46) we obtain (|4lD as desired. ■ 



From Proposition 12.2 we easily have 



Corollary 12.3. There exists a partition 



neZ 
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where for each n e Z and j = 1, 2, 3 we have 

size^ag)^,) < mm{2- n ENERGY^ SIZEj). 

Also, we may cover P n by a collection T n of trees such that 

reT n 

Proof: Since P is finite, we see that the hypotheses of Proposition |12.2| hold for all 
1 < j < 3 if n = —No for some sufficiently large N Q . Set the P ra to be empty for all 
n < —No- Now initialize n = —No and P' = P. For 1 < j < 3 in turn, we apply 
Proposition |12.2| , moving the tri-tiles in P w from P' in P ra and keeping the tri-tiles in P" 
inside P'. We then increment n and repeat this process. Since we are assuming the dp 

are non-zero, every tri-tile must eventually be absorbed into one of the P n . The properties 
are then easily verified. ■ 

From Corollary 12. 3 and Lemma 12.1 we see that 

1 3 
| TYZ\T^ a p! ai S ai S\ ^ \h\\{^m{2- n ENERGY^ SIZE,) 

for all T G T n . Summing over all T in T n and then summing over all n, we obtain 
1 3 

i £ |jqT72 a S )a S G Si s Y. 22n W mi < 2 ~ nENERGY ^ SIZE ^- 

Per n i =1 

Without loss of generality we may assume that 

ENERGYi < ENERGY 2 < ENERGY^ 
SIZE X ~ SIZE 2 ~ SIZE 3 
We may estimate the right-hand side as 

£ min(2 n ENERGY^ I ZE 2 S I ZE 3 , 

n 

ENERGYi ENERGY 2 SIZE 3 , 

2 ~ n ENERGYi ENERGY 2 , ENERGY^) 

which can be bounded by 
The claim then follows. 
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